Binaural Cepstrum Coefficient and Its Application to Ground Target Recognition

نویسندگان

  • Guan Luyang
  • Tian Jing
چکیده

Stereausis is a biologically motivated model proposed by Shamma which encodes both binaural and spectral information in a unified framework to simulate the processing of human binaural auditory system. In this paper, a new type of cepstrum coefficient is proposed based on this model. Two-channel acoustic signals are first processed by the stereausis binaural model to synthesize the spectral information and reduce the interference of noise signal. The binaural cepstrum coefficient is then extracted based on the diagonal vector of the stereausis model's output pattern, and is applied as feature to the multi-class acoustic target recognition. Learning Vector Quantization (LVQ) algorithm is implemented as the classifier and is tested by samples of vehicle acoustic signals. Experimental results show that binaural cepstrum coefficient improves both the performance and generalization of the classifier, especially at low SNR.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Zero-Crossing-Based Channel Attentive Weighting of Cepstral Features for Robust Speech Recognition: The ETRI 2011 CHiME Challenge System

We present a practical and noise-robust speech recognition system which estimates a target-to-interferers power ratio using a zero-crossing-based binaural model and applies the power ratio to a channel attentive missing feature decoder in the cepstral domain. In a natural multisource environment, our binaural model extracts spatial cues at each zero-crossing of a filterbank output signal to loc...

متن کامل

Classification Of Ground Moving Object Using Coefficient Of Integrated Bispectrum For Doppler Radar

This paper considers the classification of radar target using Backscatter Doppler signature of moving object. Classification performance evaluated by the integrated Bispectrum based technique of feature extraction and compared it with Cepstrum based feature extraction technique. Classifier performance is tested by GMM (Gaussian Mixture Model) and ML (Maximal Likelihood) decision making method. ...

متن کامل

MFCC and its applications in speaker recognition

Speech processing is emerged as one of the important application area of digital signal processing. Various fields for research in speech processing are speech recognition, speaker recognition, speech synthesis, speech coding etc. The objective of automatic speaker recognition is to extract, characterize and recognize the information about speaker identity. Feature extraction is the first step ...

متن کامل

Approaches for Automatic Speaker Recognition in a Binaural Humanoid Context

This paper presents two methods of Automatic Speaker Recognition (ASkR). ASkR has been largely studied in the last decades, but in most cases in mono-microphone or microphone array contexts. Our systems are placed in a binaural humanoid context where the signals captured by both ears of a humanoid robot will be exploited to perform the ASkR. Both methods use Mel-Frequency Cepstral Coding (MFCC)...

متن کامل

Acoustic Scene Classification Using Spatial Features

Due to various factors, the vast majority of the research in the field of Acoustic Scene Classification has used monaural or binaural datasets. This paper introduces EigenScape a new dataset of 4th-order Ambisonic acoustic scene recordings and presents preliminary analysis of this dataset. The data is classified using a standard Mel-Frequency Cepstral Coefficient Gaussian Mixture Model system, ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007